CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms

As neuroimaging algorithms and technology continue to grow faster than CPU performance in complexity and image resolution, data-parallel computing methods will be increasingly important. The high performance, data-parallel architecture of modern graphical processing units (GPUs) can reduce computational times by orders of magnitude. However, its massively threaded architecture introduces challe...

متن کامل

Memory Locality Exploitation Strategies for FFT on the CUDA Architecture

Modern graphics processing units (GPU) are becoming more and more suitable for general purpose computing due to its growing computational power. These commodity processors follow, in general, a parallel SIMD execution model whose efficiency is subject to a right exploitation of the explicit memory hierarchy, among other factors. In this paper we analyze the implementation of the Fast Fourier Tr...

متن کامل

Memory Memory Memory Compute Processor Compute Processor Compute Processor Interconnection

Many scienti c applications that run on today s multiprocessors such as weather forecast ing and seismic analysis are bottlenecked by their le I O needs Even if the multiprocessor is con gured with su cient I O hardware the le system software often fails to provide the available bandwidth to the application Although libraries and enhanced le system interfaces can make a signi cant improvement w...

متن کامل

Performance Optimization Strategies for Transactional Memory Applications

Transactional Memory (TM) has been proposed as an architectural extension to enable lock-free data structures. With the ubiquity of multi-core systems, the idea of TM gains new momentum. The motivation for the invention of TM was to simplify the synchronization of parallel threads in a shared memory system. TM features optimistic concurrency as opposed to the pessimistic concurrency with tradit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Methods and Programs in Biomedicine

سال: 2012

ISSN: 0169-2607

DOI: 10.1016/j.cmpb.2010.10.013